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Amendments to the Claims: 

This listing of claims will replace all prior versions, and listings, of claims in F 

the application: v 

Listing of Claims: 

1.-20. (Cancelled). I 

L 

21 . (Currently amended) A system for clustering data comprising: I 

a computer executing a computer program performing at least the [ 

following: t 

receiving into the computer a plurality of data points for clustering; r 



receiving into the computer a size parameter for specifying the 
number of data points to be moved — at — ofje 
tim es imultaneously evaluated for inclusion in a duster : 
clustering the data points by using the size parameter to generate 

clustered results; [ 
determining whether the clustered results are satisfactory; f 
when the clustered results are satisfactory, stop clustering; [' 
otherwise when the clustered results are not satisfactory, revise the L: 
size parameter, perform clustering based on the revised size L 
parameter and the clustered results, and proceed to step 
(d^ determininq whether the clustered results are satisfactory . I" 



22. (Currently amended) The system as defined in claim 21 wherein 

clustering the data points further comprises: 

evaluating subsets of data points in each cluster for moving into every 
other cluster by using a predetermined metric; wherein the number 
of data points in the~a.subset is specified by the size parameter 
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23. (Previously presented) The system as defined in claim 22 wherein 
evaluating subsets further comprises: 

determining a geometric center of the subset of data points being 

evaluated for a move; 
using the geometric center of the subset of data points and the 

predetermined metric to generate a value. 

24. (Previously presented) The system as defined in claim 23 wherein 
evaluating subsets further comprises: 

determining whether the value is greater than zero; 

when the value is greater than zero, moving the subset of data points from 

a l\/love_From cluster to a Move_To cluster; 
when the value is not greater than zero, determining if there are more 

subsets to evaluate; 
when there are more subsets to evaluate, proceeding to evaluating the 

subsets; 

when there are no more subsets to evaluate, determining whether any 

point has moved; 
when a point has moved, proceeding to evaluating the subsets; and 
when no point has moved, stopping the processing. 

25. (Currently amended) The ■■ systom — as — defined — in — c l a i m — 24 — furth e r 
compr isi ng:^ system for glusterinq dat^ copiprisipq; 

a computer executing a computer program performing at least the 

folfowina: 

receiving into the computer a plurality of data points for clustering: 
receiving into the computer a size parameter for specifying the number of 

data points to be moved at one time: 
clustering the data points bY .. u^inq the size parameter to generate 

clustered results: 
determining wnetner the ciusterea results are satisfactory; 

128251.03/2102.10500 Page 4 Of 1 4 HP PDNO 10992462-1 



PAGE 27/37 * RCVD AT 7/1 2/2004 3:48:46 PM [Eastern Daylight Time) * 



Sent by: CONLEY ROSE, P.C. 



713 238 8008; 



07/12/04 13:46; Jfi££fia—i¥338; Page 10/19 



AppL No. 09/684,488 

Amdt dated July 12» 2004 ^ 
Reply to Office action of May 17, 2004 

when the clustered results are satisfactory, stop clustering: ^- 

otherwise when the clustered results are not satisfactory, revise the size [ 

parameter, perform clustering based on the revised size parameter [ 

and the clustered results, and proceed to determining whether the [ 

clustered results are satisfactory. f 

wherein clustering the data points further comprises: ^ 

evaluating subsets of data points in each cluster for moving into ^ 
every other cluster bv using a predetermined metric, wherein 
the number of data points in a subset is specified by the size 

parameter. i_ 

wherein evaluating subsets further comprises: i " 
determining a geometric center of the subset of data points 

being evaluated for a move: - 

using the geometric center of the subset of data points and -T: 

the predetermined metric to generate a value: l 

detemnininq vyhether the value is greater than ^ero: L 
when the value Is greater than zero, moving the subset of 

data points from a Move From cluster to a ^ 
Move To cluster: 
when the value is not greater than zero, determining if there 

are more subsets to evaluate: r. 

when there are more subsets to evaluate, proceeding to [- 

evaluating the subsets: 

when there are no more subsets to evaluate, determining [ 

whether any point has moved: \ 

when a point has moved, proceeding to evaluating the P 

subsets: and -I 

when no point has moved, stopping the processing . p- 

wherein each data point has a membership with one cluster; and ^ 

wherein moving the subset of data points from a Move_From cluster to a r 

Move_To cluster further comprises simultaneously updating the ; 
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membership of at least two data points from the membership of the 
Move_From cluster to the membership of the Move_To cluster. 



26. (Previously presented) The system as defined in claim 24 wherein moving 
the subset of data points from a Move_From cluster to a Move_To cluster further 
comprises: 

updating the count of the Move_From cluster; 
updating the center of the Move_From cluster; 
updating the count of the Move_To cluster; 
updating the center of the Move__To cluster, 

27. (Previously presented) The system as defined in claim 21 wherein 
revising the sl^e parameter further comprises decreasing the size parameter 

28. (Cancelled). 

29. (Currently amended) Tho computer readable medium of T he system as 
defined in claim a8-22.wherein the predetermined metric of the computer program 
comprises the following expression: 



where U is the subset of data points being evaluated for the move, | U\ is 
the size of U that is specified by the size parameter, rrtu is the 
geometric center of U. mt and roj are the centers of the clusters and 
Hi and r?y are the counts of the clusters. 



n, -\U\ 



«,-i(7| 
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30. (Previously presented) The system as defined in claim 21 wherein the 
system is utilized in one of a data mining application, customer segmentation 
application, document categorization applicallon, scientific data analysis 
application, data compression application, vector quantization application, and 
image processing application. 

31. (New) The system as defined in claim 21 wherein detenmining v^etherthe 
clustered results are satisfactory further comprises: 

determining whether a change in a performance function is less than a 
predetermined value. 

32. (New) The system as defined in claim 24, further comprising: 
wherein each data point has a membership with one cluster; and 
wherein moving the subset of data points from a Move_From cluster to a 

Move_To cluster further comprises simultaneously updating the 
membership of at least two data points from the membership of the 
Move_From duster to the membership of the Move_To cluster, 

33. (New) A computer-implemented method for clustering data points, 
comprising: 

receiving a plurality of data points in a computer system; 

partitioning the plurality of data points into a plurality of clusters wherein 

each data point is a member of one cluster of the plurality of 

clusters; 

evaluating a plurality of data points in a first cluster of the plurality of 
clusters simultaneously to determine whether the plurality of data 
points in the first cluster should be moved to a second cluster of the 
plurality of clusters; and 

moving the plurality of data points simultaneously from the first cluster to 

the second cluster if the determination is that the plurality of data 
points should be moved. 
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34. (New) The computeNmplemented method of claim 33, further comprising: 
receiving a size parameter that specifies a number of data points, wherein 

the plurality of data points in a first cluster comprises the number of 
data points specified by the size parameter. 

35. (New) The computer-implemented method of claim 34, wherein evaluating 
a plurality of data points simultaneously further comprises applying a 
predetermined metric in the evaluation, wherein the predetermined metric uses 
the size parameter. 

36. (New) The computer-implemented method of claim 35, wherein the 
predetermined metric comprises the following expression: 

wherein U Is the plurality of data points being evaluated. \U\ is a size of U that Is 
specified by the size parameter, mu is a geometric center of U, mi and mj are 
geometric centers of the first cluster and the second cluster, and rti and nj are 
counts of the first cluster and the second cluster. 

37. (New) A system for clustering data points comprising: 

a computer that stores a plurality of data points for clustering; 

means for partitioning the plurality of data points into a plurality of dusters; 

means for evaluating whether an aggregated move of a subset of data 

points in a first cluster of the plurality of clusters to a second cluster 

of the plurality of clusters improves the partitioning; and 
means for performing the aggregated move of the subset of data points to 

the second duster responsive to the results of the evaluation 

mear^. 

38. (New) The system of daim 37. wherein the means for evaluating utilizes a 

pre^^termin^d metric to ^Imultan^oudty evaluate the data pofnts in the eubaet* 
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39. (New) The system of claim 38, wherein the predetermined metric 
comprises the expression: 

wherein U is the subset of data points being evaluated, \U\ is a size of U that is 
specified by a size parameter^ mu is a geometric center of U, Wt and are 
geometric centers of the first cluster and the second cluster, and /)/ and rij are 
counts of the first cluster and the second cluster 

40. (New) The system of claim 37, wherein the means for evaluating receives 
a size parameter that determines the number of data points included in the 
subset. 

41 . (New) The system of claim Z7, wherein the means for evaluating includes 
means for generating a geometric center of the subset of data points based on a 
first plurality of data points in the first cluster and a secorKi plurality of data points 
in the second cluster. 

42. (New) The system of claim 37» wherein the means for perfonming an 
aggregated move includes means to simultaneously move the data points in the 
subset of data points from the first cluster to the second cluster. 

43. (New) The system of claim 37. wherein the means for performing an 
aggregated move includes means to determine a geometric center of the first 
cluster and a geometric center of the second cluster after the subset of data points 
is moved, wherein a geometric center of the subset of data points is used in the 
determination. 

44. (New) A computer-implemented method for clustering data points 
comprising: 

receiving in a oomputer system & plurality of datsi points for clustering; 
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partitioning the plurality of data points Into a plurality of clusters; and 
repart'rtioning the plurality of data points among the plurality of clusters, 
wherein repartitioning comprises: 

using a predetermined metric to evaluate subsets of data points in 
each cluster of the plurality of clusters for moving into every 
other cluster of the plurality of clusters, wherein the number 
of data points in each subset is determined by a size 
parameter; and 

moving data points in a subset of data points simultaneously from a 
first cluster of the plurality of clusters to a second cluster of 
the plurality of clusters if the evaluation of the subset 
determines that tf)e subset should be moved into the second 
cluster 

45. (New) The computer-implemented method of claim 44, further comprising 
detemiining whether the repartitioning Is satisfactory; 

if the repartitioning is not satisfactory, performing: 
changing the size parameter, 
repartitioning the plurality of data points; and 
determining whether the repartitioning is satisfactory 

until the repartitioning is satisfactory. 

46. (New) The computer-implements method of claim 45, wherein 
determining whether the repartitioning is satisfactory comprises detemiining 
whether at least one data point was moved during the repartitioning. 

47. (New) The computer-Implemented method of claim 45, wherein 
determining whether the repartitioning is satisfactory comprises detemnining 
whether a change in a perfomnance function is less than a predetermined value. 
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48. (New) The computeMmplemented method of claim 44, wherein moving 
data points in a subset of data points simultaneously further comprises: 

updating counts of the first cluster and the second cluster; and 
recalculating geometric centers of the first cluster and the second cluster 

49. (New) The computer-implemented method of claim 44, wherein the 
predetermined metric is computed using at least the size parameter, geometric 
centers of the subset being evaluated, the first cluster, and the second cluster, 
and counts of the first duster and the second cluster. 
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